Overview

Dataset statistics

Number of variables32
Number of observations119210
Missing cells0
Missing cells (%)0.0%
Duplicate rows8161
Duplicate rows (%)6.8%
Total size in memory98.7 MiB
Average record size in memory868.2 B

Variable types

Categorical16
Numeric14
Text1
DateTime1

Alerts

Dataset has 8161 (6.8%) duplicate rowsDuplicates
children is highly imbalanced (80.6%)Imbalance
babies is highly imbalanced (97.1%)Imbalance
distribution_channel is highly imbalanced (63.2%)Imbalance
is_repeated_guest is highly imbalanced (79.8%)Imbalance
reserved_room_type is highly imbalanced (56.3%)Imbalance
deposit_type is highly imbalanced (65.3%)Imbalance
customer_type is highly imbalanced (50.6%)Imbalance
required_car_parking_spaces is highly imbalanced (85.4%)Imbalance
previous_cancellations is highly skewed (γ1 = 24.44392359)Skewed
previous_bookings_not_canceled is highly skewed (γ1 = 23.53955539)Skewed
lead_time has 6264 (5.3%) zerosZeros
stays_in_weekend_nights has 51895 (43.5%) zerosZeros
stays_in_week_nights has 7572 (6.4%) zerosZeros
previous_cancellations has 112731 (94.6%) zerosZeros
previous_bookings_not_canceled has 115597 (97.0%) zerosZeros
booking_changes has 101232 (84.9%) zerosZeros
agent has 16280 (13.7%) zerosZeros
company has 112442 (94.3%) zerosZeros
days_in_waiting_list has 115517 (96.9%) zerosZeros
adr has 1810 (1.5%) zerosZeros
total_of_special_requests has 70201 (58.9%) zerosZeros

Reproduction

Analysis started2024-05-29 08:33:11.419978
Analysis finished2024-05-29 08:33:38.955456
Duration27.54 seconds
Software versionydata-profiling v4.8.3
Download configurationconfig.json

Variables

hotel
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size11.7 MiB
City Hotel
79163 
Resort Hotel
40047 

Length

Max length12
Median length10
Mean length10.671873
Min length10

Characters and Unicode

Total characters1272194
Distinct characters12
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowResort Hotel
2nd rowResort Hotel
3rd rowResort Hotel
4th rowResort Hotel
5th rowResort Hotel

Common Values

ValueCountFrequency (%)
City Hotel 79163
66.4%
Resort Hotel 40047
33.6%

Length

2024-05-29T15:33:39.041780image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-29T15:33:39.141666image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
hotel 119210
50.0%
city 79163
33.2%
resort 40047
 
16.8%

Most occurring characters

ValueCountFrequency (%)
t 238420
18.7%
o 159257
12.5%
e 159257
12.5%
119210
9.4%
H 119210
9.4%
l 119210
9.4%
C 79163
 
6.2%
i 79163
 
6.2%
y 79163
 
6.2%
R 40047
 
3.1%
Other values (2) 80094
 
6.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 914564
71.9%
Uppercase Letter 238420
 
18.7%
Space Separator 119210
 
9.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t 238420
26.1%
o 159257
17.4%
e 159257
17.4%
l 119210
13.0%
i 79163
 
8.7%
y 79163
 
8.7%
s 40047
 
4.4%
r 40047
 
4.4%
Uppercase Letter
ValueCountFrequency (%)
H 119210
50.0%
C 79163
33.2%
R 40047
 
16.8%
Space Separator
ValueCountFrequency (%)
119210
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1152984
90.6%
Common 119210
 
9.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
t 238420
20.7%
o 159257
13.8%
e 159257
13.8%
H 119210
10.3%
l 119210
10.3%
C 79163
 
6.9%
i 79163
 
6.9%
y 79163
 
6.9%
R 40047
 
3.5%
s 40047
 
3.5%
Common
ValueCountFrequency (%)
119210
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1272194
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
t 238420
18.7%
o 159257
12.5%
e 159257
12.5%
119210
9.4%
H 119210
9.4%
l 119210
9.4%
C 79163
 
6.2%
i 79163
 
6.2%
y 79163
 
6.2%
R 40047
 
3.1%
Other values (2) 80094
 
6.3%

is_canceled
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size10.6 MiB
0
75011 
1
44199 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters119210
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 75011
62.9%
1 44199
37.1%

Length

2024-05-29T15:33:39.243827image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-29T15:33:39.343270image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
0 75011
62.9%
1 44199
37.1%

Most occurring characters

ValueCountFrequency (%)
0 75011
62.9%
1 44199
37.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 119210
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 75011
62.9%
1 44199
37.1%

Most occurring scripts

ValueCountFrequency (%)
Common 119210
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 75011
62.9%
1 44199
37.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 119210
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 75011
62.9%
1 44199
37.1%

lead_time
Real number (ℝ)

ZEROS 

Distinct479
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean104.10923
Minimum0
Maximum737
Zeros6264
Zeros (%)5.3%
Negative0
Negative (%)0.0%
Memory size5.9 MiB
2024-05-29T15:33:39.451960image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q118
median69
Q3161
95-th percentile320
Maximum737
Range737
Interquartile range (IQR)143

Descriptive statistics

Standard deviation106.87545
Coefficient of variation (CV)1.0265704
Kurtosis1.6943723
Mean104.10923
Median Absolute Deviation (MAD)60
Skewness1.3458092
Sum12410861
Variance11422.362
MonotonicityNot monotonic
2024-05-29T15:33:39.595673image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 6264
 
5.3%
1 3445
 
2.9%
2 2065
 
1.7%
3 1815
 
1.5%
4 1710
 
1.4%
5 1563
 
1.3%
6 1444
 
1.2%
7 1329
 
1.1%
8 1138
 
1.0%
12 1079
 
0.9%
Other values (469) 97358
81.7%
ValueCountFrequency (%)
0 6264
5.3%
1 3445
2.9%
2 2065
 
1.7%
3 1815
 
1.5%
4 1710
 
1.4%
5 1563
 
1.3%
6 1444
 
1.2%
7 1329
 
1.1%
8 1138
 
1.0%
9 991
 
0.8%
ValueCountFrequency (%)
737 1
 
< 0.1%
709 1
 
< 0.1%
629 17
< 0.1%
626 30
< 0.1%
622 17
< 0.1%
615 17
< 0.1%
608 17
< 0.1%
605 30
< 0.1%
601 17
< 0.1%
594 17
< 0.1%
Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size11.0 MiB
2016
56623 
2017
40620 
2015
21967 

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters476840
Distinct characters6
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2015
2nd row2015
3rd row2015
4th row2015
5th row2015

Common Values

ValueCountFrequency (%)
2016 56623
47.5%
2017 40620
34.1%
2015 21967
 
18.4%

Length

2024-05-29T15:33:39.728206image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-29T15:33:39.833579image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
2016 56623
47.5%
2017 40620
34.1%
2015 21967
 
18.4%

Most occurring characters

ValueCountFrequency (%)
2 119210
25.0%
0 119210
25.0%
1 119210
25.0%
6 56623
11.9%
7 40620
 
8.5%
5 21967
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 476840
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 119210
25.0%
0 119210
25.0%
1 119210
25.0%
6 56623
11.9%
7 40620
 
8.5%
5 21967
 
4.6%

Most occurring scripts

ValueCountFrequency (%)
Common 476840
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 119210
25.0%
0 119210
25.0%
1 119210
25.0%
6 56623
11.9%
7 40620
 
8.5%
5 21967
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 476840
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 119210
25.0%
0 119210
25.0%
1 119210
25.0%
6 56623
11.9%
7 40620
 
8.5%
5 21967
 
4.6%
Distinct12
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size11.2 MiB
August
13861 
July
12644 
May
11780 
October
11147 
April
11078 
Other values (7)
58700 

Length

Max length9
Median length7
Mean length5.9026927
Min length3

Characters and Unicode

Total characters703660
Distinct characters26
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowJuly
2nd rowJuly
3rd rowJuly
4th rowJuly
5th rowJuly

Common Values

ValueCountFrequency (%)
August 13861
11.6%
July 12644
10.6%
May 11780
9.9%
October 11147
9.4%
April 11078
9.3%
June 10929
9.2%
September 10500
8.8%
March 9768
8.2%
February 8052
6.8%
November 6771
5.7%
Other values (2) 12680
10.6%

Length

2024-05-29T15:33:39.942262image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
august 13861
11.6%
july 12644
10.6%
may 11780
9.9%
october 11147
9.4%
april 11078
9.3%
june 10929
9.2%
september 10500
8.8%
march 9768
8.2%
february 8052
6.8%
november 6771
5.7%
Other values (2) 12680
10.6%

Most occurring characters

ValueCountFrequency (%)
e 95447
13.6%
r 78048
 
11.1%
u 65268
 
9.3%
b 43229
 
6.1%
a 41442
 
5.9%
y 38397
 
5.5%
t 35508
 
5.0%
J 29494
 
4.2%
c 27674
 
3.9%
A 24939
 
3.5%
Other values (16) 224214
31.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 584450
83.1%
Uppercase Letter 119210
 
16.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 95447
16.3%
r 78048
13.4%
u 65268
11.2%
b 43229
 
7.4%
a 41442
 
7.1%
y 38397
 
6.6%
t 35508
 
6.1%
c 27674
 
4.7%
m 24030
 
4.1%
l 23722
 
4.1%
Other values (8) 111685
19.1%
Uppercase Letter
ValueCountFrequency (%)
J 29494
24.7%
A 24939
20.9%
M 21548
18.1%
O 11147
 
9.4%
S 10500
 
8.8%
F 8052
 
6.8%
N 6771
 
5.7%
D 6759
 
5.7%

Most occurring scripts

ValueCountFrequency (%)
Latin 703660
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 95447
13.6%
r 78048
 
11.1%
u 65268
 
9.3%
b 43229
 
6.1%
a 41442
 
5.9%
y 38397
 
5.5%
t 35508
 
5.0%
J 29494
 
4.2%
c 27674
 
3.9%
A 24939
 
3.5%
Other values (16) 224214
31.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 703660
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 95447
13.6%
r 78048
 
11.1%
u 65268
 
9.3%
b 43229
 
6.1%
a 41442
 
5.9%
y 38397
 
5.5%
t 35508
 
5.0%
J 29494
 
4.2%
c 27674
 
3.9%
A 24939
 
3.5%
Other values (16) 224214
31.9%

arrival_date_week_number
Real number (ℝ)

Distinct53
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27.163376
Minimum1
Maximum53
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.9 MiB
2024-05-29T15:33:40.118870image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5
Q116
median28
Q338
95-th percentile49
Maximum53
Range52
Interquartile range (IQR)22

Descriptive statistics

Standard deviation13.601107
Coefficient of variation (CV)0.5007149
Kurtosis-0.98542287
Mean27.163376
Median Absolute Deviation (MAD)11
Skewness-0.010198696
Sum3238146
Variance184.99011
MonotonicityNot monotonic
2024-05-29T15:33:40.311121image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
33 3576
 
3.0%
30 3082
 
2.6%
32 3041
 
2.6%
34 3039
 
2.5%
18 2923
 
2.5%
21 2853
 
2.4%
28 2843
 
2.4%
17 2803
 
2.4%
20 2781
 
2.3%
29 2763
 
2.3%
Other values (43) 89506
75.1%
ValueCountFrequency (%)
1 1045
0.9%
2 1216
1.0%
3 1318
1.1%
4 1485
1.2%
5 1385
1.2%
6 1507
1.3%
7 2102
1.8%
8 2212
1.9%
9 2109
1.8%
10 2142
1.8%
ValueCountFrequency (%)
53 1811
1.5%
52 1187
1.0%
51 933
0.8%
50 1498
1.3%
49 1780
1.5%
48 1495
1.3%
47 1677
1.4%
46 1570
1.3%
45 1940
1.6%
44 2270
1.9%

arrival_date_day_of_month
Real number (ℝ)

Distinct31
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.798717
Minimum1
Maximum31
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.9 MiB
2024-05-29T15:33:40.440237image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q18
median16
Q323
95-th percentile30
Maximum31
Range30
Interquartile range (IQR)15

Descriptive statistics

Standard deviation8.7810701
Coefficient of variation (CV)0.55580908
Kurtosis-1.1870963
Mean15.798717
Median Absolute Deviation (MAD)8
Skewness-0.0021109856
Sum1883365
Variance77.107192
MonotonicityNot monotonic
2024-05-29T15:33:40.664399image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
17 4401
 
3.7%
5 4310
 
3.6%
15 4188
 
3.5%
25 4155
 
3.5%
26 4141
 
3.5%
9 4090
 
3.4%
12 4082
 
3.4%
16 4071
 
3.4%
2 4054
 
3.4%
19 4048
 
3.4%
Other values (21) 77670
65.2%
ValueCountFrequency (%)
1 3620
3.0%
2 4054
3.4%
3 3847
3.2%
4 3760
3.2%
5 4310
3.6%
6 3819
3.2%
7 3658
3.1%
8 3919
3.3%
9 4090
3.4%
10 3569
3.0%
ValueCountFrequency (%)
31 2207
1.9%
30 3844
3.2%
29 3580
3.0%
28 3942
3.3%
27 3791
3.2%
26 4141
3.5%
25 4155
3.5%
24 3983
3.3%
23 3612
3.0%
22 3593
3.0%

stays_in_weekend_nights
Real number (ℝ)

ZEROS 

Distinct17
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.9270531
Minimum0
Maximum19
Zeros51895
Zeros (%)43.5%
Negative0
Negative (%)0.0%
Memory size5.9 MiB
2024-05-29T15:33:40.809226image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32
95-th percentile2
Maximum19
Range19
Interquartile range (IQR)2

Descriptive statistics

Standard deviation0.99511703
Coefficient of variation (CV)1.0734197
Kurtosis6.3653972
Mean0.9270531
Median Absolute Deviation (MAD)1
Skewness1.3202425
Sum110514
Variance0.9902579
MonotonicityNot monotonic
2024-05-29T15:33:40.924610image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
0 51895
43.5%
2 33266
27.9%
1 30615
25.7%
4 1847
 
1.5%
3 1252
 
1.1%
6 152
 
0.1%
5 77
 
0.1%
8 58
 
< 0.1%
7 19
 
< 0.1%
9 10
 
< 0.1%
Other values (7) 19
 
< 0.1%
ValueCountFrequency (%)
0 51895
43.5%
1 30615
25.7%
2 33266
27.9%
3 1252
 
1.1%
4 1847
 
1.5%
5 77
 
0.1%
6 152
 
0.1%
7 19
 
< 0.1%
8 58
 
< 0.1%
9 10
 
< 0.1%
ValueCountFrequency (%)
19 1
 
< 0.1%
18 1
 
< 0.1%
16 2
 
< 0.1%
14 1
 
< 0.1%
13 2
 
< 0.1%
12 5
 
< 0.1%
10 7
 
< 0.1%
9 10
 
< 0.1%
8 58
< 0.1%
7 19
 
< 0.1%

stays_in_week_nights
Real number (ℝ)

ZEROS 

Distinct33
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.4991947
Minimum0
Maximum50
Zeros7572
Zeros (%)6.4%
Negative0
Negative (%)0.0%
Memory size5.9 MiB
2024-05-29T15:33:41.049256image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q33
95-th percentile5
Maximum50
Range50
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.8971058
Coefficient of variation (CV)0.75908683
Kurtosis22.250866
Mean2.4991947
Median Absolute Deviation (MAD)1
Skewness2.7548629
Sum297929
Variance3.5990103
MonotonicityNot monotonic
2024-05-29T15:33:41.184428image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
2 33670
28.2%
1 30292
25.4%
3 22241
18.7%
5 11068
 
9.3%
4 9543
 
8.0%
0 7572
 
6.4%
6 1494
 
1.3%
10 1030
 
0.9%
7 1024
 
0.9%
8 654
 
0.5%
Other values (23) 622
 
0.5%
ValueCountFrequency (%)
0 7572
 
6.4%
1 30292
25.4%
2 33670
28.2%
3 22241
18.7%
4 9543
 
8.0%
5 11068
 
9.3%
6 1494
 
1.3%
7 1024
 
0.9%
8 654
 
0.5%
9 228
 
0.2%
ValueCountFrequency (%)
50 1
 
< 0.1%
42 1
 
< 0.1%
40 2
 
< 0.1%
34 1
 
< 0.1%
33 1
 
< 0.1%
32 1
 
< 0.1%
30 4
< 0.1%
26 1
 
< 0.1%
25 6
< 0.1%
24 3
< 0.1%

adults
Real number (ℝ)

Distinct14
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.8592064
Minimum0
Maximum55
Zeros223
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size5.9 MiB
2024-05-29T15:33:41.299237image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q12
median2
Q32
95-th percentile3
Maximum55
Range55
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.57518558
Coefficient of variation (CV)0.30937155
Kurtosis1392.5063
Mean1.8592064
Median Absolute Deviation (MAD)0
Skewness18.774333
Sum221636
Variance0.33083845
MonotonicityNot monotonic
2024-05-29T15:33:41.419053image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
2 89680
75.2%
1 23027
 
19.3%
3 6202
 
5.2%
0 223
 
0.2%
4 62
 
0.1%
26 5
 
< 0.1%
27 2
 
< 0.1%
20 2
 
< 0.1%
5 2
 
< 0.1%
40 1
 
< 0.1%
Other values (4) 4
 
< 0.1%
ValueCountFrequency (%)
0 223
 
0.2%
1 23027
 
19.3%
2 89680
75.2%
3 6202
 
5.2%
4 62
 
0.1%
5 2
 
< 0.1%
6 1
 
< 0.1%
10 1
 
< 0.1%
20 2
 
< 0.1%
26 5
 
< 0.1%
ValueCountFrequency (%)
55 1
 
< 0.1%
50 1
 
< 0.1%
40 1
 
< 0.1%
27 2
 
< 0.1%
26 5
 
< 0.1%
20 2
 
< 0.1%
10 1
 
< 0.1%
6 1
 
< 0.1%
5 2
 
< 0.1%
4 62
0.1%

children
Categorical

IMBALANCE 

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size10.9 MiB
0.0
110620 
1.0
 
4861
2.0
 
3652
3.0
 
76
10.0
 
1

Length

Max length4
Median length3
Mean length3.0000084
Min length3

Characters and Unicode

Total characters357631
Distinct characters5
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row0.0
2nd row0.0
3rd row0.0
4th row0.0
5th row0.0

Common Values

ValueCountFrequency (%)
0.0 110620
92.8%
1.0 4861
 
4.1%
2.0 3652
 
3.1%
3.0 76
 
0.1%
10.0 1
 
< 0.1%

Length

2024-05-29T15:33:41.546007image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-29T15:33:41.652292image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
0.0 110620
92.8%
1.0 4861
 
4.1%
2.0 3652
 
3.1%
3.0 76
 
0.1%
10.0 1
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
0 229831
64.3%
. 119210
33.3%
1 4862
 
1.4%
2 3652
 
1.0%
3 76
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 238421
66.7%
Other Punctuation 119210
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 229831
96.4%
1 4862
 
2.0%
2 3652
 
1.5%
3 76
 
< 0.1%
Other Punctuation
ValueCountFrequency (%)
. 119210
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 357631
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 229831
64.3%
. 119210
33.3%
1 4862
 
1.4%
2 3652
 
1.0%
3 76
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 357631
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 229831
64.3%
. 119210
33.3%
1 4862
 
1.4%
2 3652
 
1.0%
3 76
 
< 0.1%

babies
Categorical

IMBALANCE 

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size10.6 MiB
0
118293 
1
 
900
2
 
15
10
 
1
9
 
1

Length

Max length2
Median length1
Mean length1.0000084
Min length1

Characters and Unicode

Total characters119211
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 118293
99.2%
1 900
 
0.8%
2 15
 
< 0.1%
10 1
 
< 0.1%
9 1
 
< 0.1%

Length

2024-05-29T15:33:41.840346image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-29T15:33:41.963500image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
0 118293
99.2%
1 900
 
0.8%
2 15
 
< 0.1%
10 1
 
< 0.1%
9 1
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
0 118294
99.2%
1 901
 
0.8%
2 15
 
< 0.1%
9 1
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 119211
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 118294
99.2%
1 901
 
0.8%
2 15
 
< 0.1%
9 1
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common 119211
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 118294
99.2%
1 901
 
0.8%
2 15
 
< 0.1%
9 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 119211
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 118294
99.2%
1 901
 
0.8%
2 15
 
< 0.1%
9 1
 
< 0.1%

meal
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size10.7 MiB
BB
92236 
HB
14458 
SC
11718 
FB
 
798

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters238420
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowBB
2nd rowBB
3rd rowBB
4th rowBB
5th rowBB

Common Values

ValueCountFrequency (%)
BB 92236
77.4%
HB 14458
 
12.1%
SC 11718
 
9.8%
FB 798
 
0.7%

Length

2024-05-29T15:33:42.087366image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-29T15:33:42.198641image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
bb 92236
77.4%
hb 14458
 
12.1%
sc 11718
 
9.8%
fb 798
 
0.7%

Most occurring characters

ValueCountFrequency (%)
B 199728
83.8%
H 14458
 
6.1%
S 11718
 
4.9%
C 11718
 
4.9%
F 798
 
0.3%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 238420
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
B 199728
83.8%
H 14458
 
6.1%
S 11718
 
4.9%
C 11718
 
4.9%
F 798
 
0.3%

Most occurring scripts

ValueCountFrequency (%)
Latin 238420
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
B 199728
83.8%
H 14458
 
6.1%
S 11718
 
4.9%
C 11718
 
4.9%
F 798
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 238420
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
B 199728
83.8%
H 14458
 
6.1%
S 11718
 
4.9%
C 11718
 
4.9%
F 798
 
0.3%
Distinct178
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.9 MiB
2024-05-29T15:33:42.377359image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Length

Max length7
Median length3
Mean length3.00531
Min length2

Characters and Unicode

Total characters358263
Distinct characters30
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)< 0.1%

Sample

1st rowPRT
2nd rowPRT
3rd rowGBR
4th rowGBR
5th rowGBR
ValueCountFrequency (%)
prt 48483
40.7%
gbr 12120
 
10.2%
fra 10401
 
8.7%
esp 8560
 
7.2%
deu 7285
 
6.1%
ita 3761
 
3.2%
irl 3374
 
2.8%
bel 2342
 
2.0%
bra 2222
 
1.9%
nld 2103
 
1.8%
Other values (168) 18559
 
15.6%
2024-05-29T15:33:42.707993image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
R 80668
22.5%
P 58389
16.3%
T 54151
15.1%
A 21602
 
6.0%
E 21520
 
6.0%
B 17040
 
4.8%
S 13911
 
3.9%
U 13762
 
3.8%
G 13120
 
3.7%
F 10941
 
3.1%
Other values (20) 53159
14.8%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 355395
99.2%
Lowercase Letter 2868
 
0.8%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
R 80668
22.7%
P 58389
16.4%
T 54151
15.2%
A 21602
 
6.1%
E 21520
 
6.1%
B 17040
 
4.8%
S 13911
 
3.9%
U 13762
 
3.9%
G 13120
 
3.7%
F 10941
 
3.1%
Other values (16) 50291
14.2%
Lowercase Letter
ValueCountFrequency (%)
n 1434
50.0%
k 478
 
16.7%
o 478
 
16.7%
w 478
 
16.7%

Most occurring scripts

ValueCountFrequency (%)
Latin 358263
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
R 80668
22.5%
P 58389
16.3%
T 54151
15.1%
A 21602
 
6.0%
E 21520
 
6.0%
B 17040
 
4.8%
S 13911
 
3.9%
U 13762
 
3.8%
G 13120
 
3.7%
F 10941
 
3.1%
Other values (20) 53159
14.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 358263
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
R 80668
22.5%
P 58389
16.3%
T 54151
15.1%
A 21602
 
6.0%
E 21520
 
6.0%
B 17040
 
4.8%
S 13911
 
3.9%
U 13762
 
3.8%
G 13120
 
3.7%
F 10941
 
3.1%
Other values (20) 53159
14.8%

market_segment
Categorical

Distinct8
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size11.5 MiB
Online TA
56408 
Offline TA/TO
24182 
Groups
19791 
Direct
12582 
Corporate
 
5282
Other values (3)
 
965

Length

Max length13
Median length9
Mean length9.0191762
Min length6

Characters and Unicode

Total characters1075176
Distinct characters26
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowDirect
2nd rowDirect
3rd rowDirect
4th rowCorporate
5th rowOnline TA

Common Values

ValueCountFrequency (%)
Online TA 56408
47.3%
Offline TA/TO 24182
20.3%
Groups 19791
 
16.6%
Direct 12582
 
10.6%
Corporate 5282
 
4.4%
Complementary 728
 
0.6%
Aviation 235
 
0.2%
Undefined 2
 
< 0.1%

Length

2024-05-29T15:33:42.863212image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-29T15:33:42.990460image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
online 56408
28.2%
ta 56408
28.2%
offline 24182
12.1%
ta/to 24182
12.1%
groups 19791
 
9.9%
direct 12582
 
6.3%
corporate 5282
 
2.6%
complementary 728
 
0.4%
aviation 235
 
0.1%
undefined 2
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
n 137965
12.8%
O 104772
9.7%
T 104772
9.7%
e 99914
9.3%
i 93644
8.7%
l 81318
7.6%
A 80825
7.5%
80590
7.5%
f 48366
 
4.5%
r 43665
 
4.1%
Other values (16) 199345
18.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 641650
59.7%
Uppercase Letter 328754
30.6%
Space Separator 80590
 
7.5%
Other Punctuation 24182
 
2.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 137965
21.5%
e 99914
15.6%
i 93644
14.6%
l 81318
12.7%
f 48366
 
7.5%
r 43665
 
6.8%
o 31318
 
4.9%
p 25801
 
4.0%
s 19791
 
3.1%
u 19791
 
3.1%
Other values (7) 40077
 
6.2%
Uppercase Letter
ValueCountFrequency (%)
O 104772
31.9%
T 104772
31.9%
A 80825
24.6%
G 19791
 
6.0%
D 12582
 
3.8%
C 6010
 
1.8%
U 2
 
< 0.1%
Space Separator
ValueCountFrequency (%)
80590
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 24182
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 970404
90.3%
Common 104772
 
9.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 137965
14.2%
O 104772
10.8%
T 104772
10.8%
e 99914
10.3%
i 93644
9.7%
l 81318
8.4%
A 80825
8.3%
f 48366
 
5.0%
r 43665
 
4.5%
o 31318
 
3.2%
Other values (14) 143845
14.8%
Common
ValueCountFrequency (%)
80590
76.9%
/ 24182
 
23.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1075176
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n 137965
12.8%
O 104772
9.7%
T 104772
9.7%
e 99914
9.3%
i 93644
8.7%
l 81318
7.6%
A 80825
7.5%
80590
7.5%
f 48366
 
4.5%
r 43665
 
4.1%
Other values (16) 199345
18.5%

distribution_channel
Categorical

IMBALANCE 

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size11.1 MiB
TA/TO
97750 
Direct
14611 
Corporate
 
6651
GDS
 
193
Undefined
 
5

Length

Max length9
Median length5
Mean length5.3426642
Min length3

Characters and Unicode

Total characters636899
Distinct characters20
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowDirect
2nd rowDirect
3rd rowDirect
4th rowCorporate
5th rowTA/TO

Common Values

ValueCountFrequency (%)
TA/TO 97750
82.0%
Direct 14611
 
12.3%
Corporate 6651
 
5.6%
GDS 193
 
0.2%
Undefined 5
 
< 0.1%

Length

2024-05-29T15:33:43.146860image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-29T15:33:43.262557image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
ta/to 97750
82.0%
direct 14611
 
12.3%
corporate 6651
 
5.6%
gds 193
 
0.2%
undefined 5
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
T 195500
30.7%
/ 97750
15.3%
O 97750
15.3%
A 97750
15.3%
r 27913
 
4.4%
e 21272
 
3.3%
t 21262
 
3.3%
D 14804
 
2.3%
i 14616
 
2.3%
c 14611
 
2.3%
Other values (10) 33671
 
5.3%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 412846
64.8%
Lowercase Letter 126303
 
19.8%
Other Punctuation 97750
 
15.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r 27913
22.1%
e 21272
16.8%
t 21262
16.8%
i 14616
11.6%
c 14611
11.6%
o 13302
10.5%
a 6651
 
5.3%
p 6651
 
5.3%
n 10
 
< 0.1%
d 10
 
< 0.1%
Uppercase Letter
ValueCountFrequency (%)
T 195500
47.4%
O 97750
23.7%
A 97750
23.7%
D 14804
 
3.6%
C 6651
 
1.6%
G 193
 
< 0.1%
S 193
 
< 0.1%
U 5
 
< 0.1%
Other Punctuation
ValueCountFrequency (%)
/ 97750
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 539149
84.7%
Common 97750
 
15.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
T 195500
36.3%
O 97750
18.1%
A 97750
18.1%
r 27913
 
5.2%
e 21272
 
3.9%
t 21262
 
3.9%
D 14804
 
2.7%
i 14616
 
2.7%
c 14611
 
2.7%
o 13302
 
2.5%
Other values (9) 20369
 
3.8%
Common
ValueCountFrequency (%)
/ 97750
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 636899
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
T 195500
30.7%
/ 97750
15.3%
O 97750
15.3%
A 97750
15.3%
r 27913
 
4.4%
e 21272
 
3.3%
t 21262
 
3.3%
D 14804
 
2.3%
i 14616
 
2.3%
c 14611
 
2.3%
Other values (10) 33671
 
5.3%

is_repeated_guest
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size10.6 MiB
0
115455 
1
 
3755

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters119210
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 115455
96.9%
1 3755
 
3.1%

Length

2024-05-29T15:33:43.398714image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-29T15:33:43.512830image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
0 115455
96.9%
1 3755
 
3.1%

Most occurring characters

ValueCountFrequency (%)
0 115455
96.9%
1 3755
 
3.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 119210
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 115455
96.9%
1 3755
 
3.1%

Most occurring scripts

ValueCountFrequency (%)
Common 119210
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 115455
96.9%
1 3755
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 119210
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 115455
96.9%
1 3755
 
3.1%

previous_cancellations
Real number (ℝ)

SKEWED  ZEROS 

Distinct15
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.087190672
Minimum0
Maximum26
Zeros112731
Zeros (%)94.6%
Negative0
Negative (%)0.0%
Memory size5.9 MiB
2024-05-29T15:33:43.612066image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum26
Range26
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.84491826
Coefficient of variation (CV)9.6904663
Kurtosis673.22115
Mean0.087190672
Median Absolute Deviation (MAD)0
Skewness24.443924
Sum10394
Variance0.71388687
MonotonicityNot monotonic
2024-05-29T15:33:43.720642image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
0 112731
94.6%
1 6048
 
5.1%
2 114
 
0.1%
3 65
 
0.1%
24 48
 
< 0.1%
11 35
 
< 0.1%
4 31
 
< 0.1%
26 26
 
< 0.1%
25 25
 
< 0.1%
6 22
 
< 0.1%
Other values (5) 65
 
0.1%
ValueCountFrequency (%)
0 112731
94.6%
1 6048
 
5.1%
2 114
 
0.1%
3 65
 
0.1%
4 31
 
< 0.1%
5 19
 
< 0.1%
6 22
 
< 0.1%
11 35
 
< 0.1%
13 12
 
< 0.1%
14 14
 
< 0.1%
ValueCountFrequency (%)
26 26
< 0.1%
25 25
< 0.1%
24 48
< 0.1%
21 1
 
< 0.1%
19 19
 
< 0.1%
14 14
 
< 0.1%
13 12
 
< 0.1%
11 35
< 0.1%
6 22
< 0.1%
5 19
 
< 0.1%

previous_bookings_not_canceled
Real number (ℝ)

SKEWED  ZEROS 

Distinct73
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1370942
Minimum0
Maximum72
Zeros115597
Zeros (%)97.0%
Negative0
Negative (%)0.0%
Memory size5.9 MiB
2024-05-29T15:33:44.292581image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum72
Range72
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.4981372
Coefficient of variation (CV)10.927794
Kurtosis766.95283
Mean0.1370942
Median Absolute Deviation (MAD)0
Skewness23.539555
Sum16343
Variance2.244415
MonotonicityNot monotonic
2024-05-29T15:33:44.432409image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 115597
97.0%
1 1538
 
1.3%
2 580
 
0.5%
3 333
 
0.3%
4 229
 
0.2%
5 181
 
0.2%
6 113
 
0.1%
7 88
 
0.1%
8 70
 
0.1%
9 59
 
< 0.1%
Other values (63) 422
 
0.4%
ValueCountFrequency (%)
0 115597
97.0%
1 1538
 
1.3%
2 580
 
0.5%
3 333
 
0.3%
4 229
 
0.2%
5 181
 
0.2%
6 113
 
0.1%
7 88
 
0.1%
8 70
 
0.1%
9 59
 
< 0.1%
ValueCountFrequency (%)
72 1
< 0.1%
71 1
< 0.1%
70 1
< 0.1%
69 1
< 0.1%
68 1
< 0.1%
67 1
< 0.1%
66 1
< 0.1%
65 1
< 0.1%
64 1
< 0.1%
63 1
< 0.1%

reserved_room_type
Categorical

IMBALANCE 

Distinct9
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size10.6 MiB
A
85873 
D
19179 
E
 
6519
F
 
2894
G
 
2092
Other values (4)
 
2653

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters119210
Distinct characters9
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowC
2nd rowC
3rd rowA
4th rowA
5th rowA

Common Values

ValueCountFrequency (%)
A 85873
72.0%
D 19179
 
16.1%
E 6519
 
5.5%
F 2894
 
2.4%
G 2092
 
1.8%
B 1115
 
0.9%
C 931
 
0.8%
H 601
 
0.5%
L 6
 
< 0.1%

Length

2024-05-29T15:33:44.618798image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-29T15:33:44.733330image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
a 85873
72.0%
d 19179
 
16.1%
e 6519
 
5.5%
f 2894
 
2.4%
g 2092
 
1.8%
b 1115
 
0.9%
c 931
 
0.8%
h 601
 
0.5%
l 6
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
A 85873
72.0%
D 19179
 
16.1%
E 6519
 
5.5%
F 2894
 
2.4%
G 2092
 
1.8%
B 1115
 
0.9%
C 931
 
0.8%
H 601
 
0.5%
L 6
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 119210
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 85873
72.0%
D 19179
 
16.1%
E 6519
 
5.5%
F 2894
 
2.4%
G 2092
 
1.8%
B 1115
 
0.9%
C 931
 
0.8%
H 601
 
0.5%
L 6
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Latin 119210
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 85873
72.0%
D 19179
 
16.1%
E 6519
 
5.5%
F 2894
 
2.4%
G 2092
 
1.8%
B 1115
 
0.9%
C 931
 
0.8%
H 601
 
0.5%
L 6
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 119210
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A 85873
72.0%
D 19179
 
16.1%
E 6519
 
5.5%
F 2894
 
2.4%
G 2092
 
1.8%
B 1115
 
0.9%
C 931
 
0.8%
H 601
 
0.5%
L 6
 
< 0.1%
Distinct11
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size10.6 MiB
A
74020 
D
25309 
E
7798 
F
 
3751
G
 
2549
Other values (6)
 
5783

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters119210
Distinct characters11
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowC
2nd rowC
3rd rowC
4th rowA
5th rowA

Common Values

ValueCountFrequency (%)
A 74020
62.1%
D 25309
 
21.2%
E 7798
 
6.5%
F 3751
 
3.1%
G 2549
 
2.1%
C 2370
 
2.0%
B 2154
 
1.8%
H 712
 
0.6%
I 359
 
0.3%
K 187
 
0.2%

Length

2024-05-29T15:33:44.892372image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
a 74020
62.1%
d 25309
 
21.2%
e 7798
 
6.5%
f 3751
 
3.1%
g 2549
 
2.1%
c 2370
 
2.0%
b 2154
 
1.8%
h 712
 
0.6%
i 359
 
0.3%
k 187
 
0.2%

Most occurring characters

ValueCountFrequency (%)
A 74020
62.1%
D 25309
 
21.2%
E 7798
 
6.5%
F 3751
 
3.1%
G 2549
 
2.1%
C 2370
 
2.0%
B 2154
 
1.8%
H 712
 
0.6%
I 359
 
0.3%
K 187
 
0.2%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 119210
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 74020
62.1%
D 25309
 
21.2%
E 7798
 
6.5%
F 3751
 
3.1%
G 2549
 
2.1%
C 2370
 
2.0%
B 2154
 
1.8%
H 712
 
0.6%
I 359
 
0.3%
K 187
 
0.2%

Most occurring scripts

ValueCountFrequency (%)
Latin 119210
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 74020
62.1%
D 25309
 
21.2%
E 7798
 
6.5%
F 3751
 
3.1%
G 2549
 
2.1%
C 2370
 
2.0%
B 2154
 
1.8%
H 712
 
0.6%
I 359
 
0.3%
K 187
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 119210
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A 74020
62.1%
D 25309
 
21.2%
E 7798
 
6.5%
F 3751
 
3.1%
G 2549
 
2.1%
C 2370
 
2.0%
B 2154
 
1.8%
H 712
 
0.6%
I 359
 
0.3%
K 187
 
0.2%

booking_changes
Real number (ℝ)

ZEROS 

Distinct19
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.21879876
Minimum0
Maximum18
Zeros101232
Zeros (%)84.9%
Negative0
Negative (%)0.0%
Memory size5.9 MiB
2024-05-29T15:33:45.011618image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum18
Range18
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.63850446
Coefficient of variation (CV)2.9182271
Kurtosis63.437992
Mean0.21879876
Median Absolute Deviation (MAD)0
Skewness5.5000578
Sum26083
Variance0.40768794
MonotonicityNot monotonic
2024-05-29T15:33:45.136189image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
0 101232
84.9%
1 12666
 
10.6%
2 3780
 
3.2%
3 914
 
0.8%
4 367
 
0.3%
5 115
 
0.1%
6 61
 
0.1%
7 29
 
< 0.1%
8 14
 
< 0.1%
9 8
 
< 0.1%
Other values (9) 24
 
< 0.1%
ValueCountFrequency (%)
0 101232
84.9%
1 12666
 
10.6%
2 3780
 
3.2%
3 914
 
0.8%
4 367
 
0.3%
5 115
 
0.1%
6 61
 
0.1%
7 29
 
< 0.1%
8 14
 
< 0.1%
9 8
 
< 0.1%
ValueCountFrequency (%)
18 1
 
< 0.1%
17 2
 
< 0.1%
16 2
 
< 0.1%
15 3
 
< 0.1%
14 3
 
< 0.1%
13 5
< 0.1%
12 1
 
< 0.1%
11 1
 
< 0.1%
10 6
< 0.1%
9 8
< 0.1%

deposit_type
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size11.6 MiB
No Deposit
104461 
Non Refund
14587 
Refundable
 
162

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters1192100
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNo Deposit
2nd rowNo Deposit
3rd rowNo Deposit
4th rowNo Deposit
5th rowNo Deposit

Common Values

ValueCountFrequency (%)
No Deposit 104461
87.6%
Non Refund 14587
 
12.2%
Refundable 162
 
0.1%

Length

2024-05-29T15:33:45.291658image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-29T15:33:45.397513image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
no 104461
43.8%
deposit 104461
43.8%
non 14587
 
6.1%
refund 14587
 
6.1%
refundable 162
 
0.1%

Most occurring characters

ValueCountFrequency (%)
o 223509
18.7%
e 119372
10.0%
N 119048
10.0%
119048
10.0%
s 104461
8.8%
i 104461
8.8%
t 104461
8.8%
p 104461
8.8%
D 104461
8.8%
n 29336
 
2.5%
Other values (7) 59482
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 834794
70.0%
Uppercase Letter 238258
 
20.0%
Space Separator 119048
 
10.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 223509
26.8%
e 119372
14.3%
s 104461
12.5%
i 104461
12.5%
t 104461
12.5%
p 104461
12.5%
n 29336
 
3.5%
f 14749
 
1.8%
u 14749
 
1.8%
d 14749
 
1.8%
Other values (3) 486
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
N 119048
50.0%
D 104461
43.8%
R 14749
 
6.2%
Space Separator
ValueCountFrequency (%)
119048
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1073052
90.0%
Common 119048
 
10.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 223509
20.8%
e 119372
11.1%
N 119048
11.1%
s 104461
9.7%
i 104461
9.7%
t 104461
9.7%
p 104461
9.7%
D 104461
9.7%
n 29336
 
2.7%
R 14749
 
1.4%
Other values (6) 44733
 
4.2%
Common
ValueCountFrequency (%)
119048
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1192100
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 223509
18.7%
e 119372
10.0%
N 119048
10.0%
119048
10.0%
s 104461
8.8%
i 104461
8.8%
t 104461
8.8%
p 104461
8.8%
D 104461
8.8%
n 29336
 
2.5%
Other values (7) 59482
 
5.0%

agent
Real number (ℝ)

ZEROS 

Distinct334
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean74.889078
Minimum0
Maximum535
Zeros16280
Zeros (%)13.7%
Negative0
Negative (%)0.0%
Memory size5.9 MiB
2024-05-29T15:33:45.620713image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q17
median9
Q3152
95-th percentile250
Maximum535
Range535
Interquartile range (IQR)145

Descriptive statistics

Standard deviation107.16888
Coefficient of variation (CV)1.4310349
Kurtosis0.50744609
Mean74.889078
Median Absolute Deviation (MAD)9
Skewness1.2985454
Sum8927527
Variance11485.17
MonotonicityNot monotonic
2024-05-29T15:33:45.795945image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9 31922
26.8%
0 16280
13.7%
240 13922
11.7%
1 7187
 
6.0%
14 3633
 
3.0%
7 3532
 
3.0%
6 3290
 
2.8%
250 2870
 
2.4%
241 1721
 
1.4%
28 1657
 
1.4%
Other values (324) 33196
27.8%
ValueCountFrequency (%)
0 16280
13.7%
1 7187
 
6.0%
2 162
 
0.1%
3 1336
 
1.1%
4 47
 
< 0.1%
5 330
 
0.3%
6 3290
 
2.8%
7 3532
 
3.0%
8 1514
 
1.3%
9 31922
26.8%
ValueCountFrequency (%)
535 3
 
< 0.1%
531 68
0.1%
527 35
< 0.1%
526 10
 
< 0.1%
510 2
 
< 0.1%
509 10
 
< 0.1%
508 6
 
< 0.1%
502 24
 
< 0.1%
497 1
 
< 0.1%
495 57
< 0.1%

company
Real number (ℝ)

ZEROS 

Distinct349
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.7354
Minimum0
Maximum543
Zeros112442
Zeros (%)94.3%
Negative0
Negative (%)0.0%
Memory size5.9 MiB
2024-05-29T15:33:45.983068image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile40
Maximum543
Range543
Interquartile range (IQR)0

Descriptive statistics

Standard deviation53.830143
Coefficient of variation (CV)5.0142654
Kurtosis37.898876
Mean10.7354
Median Absolute Deviation (MAD)0
Skewness5.9292756
Sum1279767
Variance2897.6843
MonotonicityNot monotonic
2024-05-29T15:33:46.150558image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 112442
94.3%
40 924
 
0.8%
223 784
 
0.7%
67 267
 
0.2%
45 249
 
0.2%
153 213
 
0.2%
174 147
 
0.1%
219 141
 
0.1%
281 138
 
0.1%
154 133
 
0.1%
Other values (339) 3772
 
3.2%
ValueCountFrequency (%)
0 112442
94.3%
6 1
 
< 0.1%
8 1
 
< 0.1%
9 37
 
< 0.1%
10 1
 
< 0.1%
11 1
 
< 0.1%
12 14
 
< 0.1%
14 9
 
< 0.1%
16 5
 
< 0.1%
18 1
 
< 0.1%
ValueCountFrequency (%)
543 2
 
< 0.1%
541 1
 
< 0.1%
539 2
 
< 0.1%
534 2
 
< 0.1%
531 1
 
< 0.1%
530 5
 
< 0.1%
528 2
 
< 0.1%
525 15
< 0.1%
523 17
< 0.1%
521 7
< 0.1%

days_in_waiting_list
Real number (ℝ)

ZEROS 

Distinct127
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.3212147
Minimum0
Maximum391
Zeros115517
Zeros (%)96.9%
Negative0
Negative (%)0.0%
Memory size5.9 MiB
2024-05-29T15:33:46.413085image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum391
Range391
Interquartile range (IQR)0

Descriptive statistics

Standard deviation17.598002
Coefficient of variation (CV)7.5813763
Kurtosis186.89459
Mean2.3212147
Median Absolute Deviation (MAD)0
Skewness11.948868
Sum276712
Variance309.68967
MonotonicityNot monotonic
2024-05-29T15:33:46.610701image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 115517
96.9%
39 227
 
0.2%
58 164
 
0.1%
44 141
 
0.1%
31 127
 
0.1%
35 96
 
0.1%
46 94
 
0.1%
69 89
 
0.1%
63 83
 
0.1%
87 80
 
0.1%
Other values (117) 2592
 
2.2%
ValueCountFrequency (%)
0 115517
96.9%
1 12
 
< 0.1%
2 5
 
< 0.1%
3 59
 
< 0.1%
4 25
 
< 0.1%
5 8
 
< 0.1%
6 15
 
< 0.1%
7 4
 
< 0.1%
8 7
 
< 0.1%
9 16
 
< 0.1%
ValueCountFrequency (%)
391 45
< 0.1%
379 15
 
< 0.1%
330 15
 
< 0.1%
259 10
 
< 0.1%
236 35
< 0.1%
224 10
 
< 0.1%
223 61
0.1%
215 21
 
< 0.1%
207 15
 
< 0.1%
193 1
 
< 0.1%

customer_type
Categorical

IMBALANCE 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size11.7 MiB
Transient
89476 
Transient-Party
25088 
Contract
 
4072
Group
 
574

Length

Max length15
Median length9
Mean length10.209295
Min length5

Characters and Unicode

Total characters1217050
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowTransient
2nd rowTransient
3rd rowTransient
4th rowTransient
5th rowTransient

Common Values

ValueCountFrequency (%)
Transient 89476
75.1%
Transient-Party 25088
 
21.0%
Contract 4072
 
3.4%
Group 574
 
0.5%

Length

2024-05-29T15:33:46.794272image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-29T15:33:46.899166image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
transient 89476
75.1%
transient-party 25088
 
21.0%
contract 4072
 
3.4%
group 574
 
0.5%

Most occurring characters

ValueCountFrequency (%)
n 233200
19.2%
t 147796
12.1%
r 144298
11.9%
a 143724
11.8%
T 114564
9.4%
s 114564
9.4%
i 114564
9.4%
e 114564
9.4%
y 25088
 
2.1%
- 25088
 
2.1%
Other values (7) 39600
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1047664
86.1%
Uppercase Letter 144298
 
11.9%
Dash Punctuation 25088
 
2.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 233200
22.3%
t 147796
14.1%
r 144298
13.8%
a 143724
13.7%
s 114564
10.9%
i 114564
10.9%
e 114564
10.9%
y 25088
 
2.4%
o 4646
 
0.4%
c 4072
 
0.4%
Other values (2) 1148
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
T 114564
79.4%
P 25088
 
17.4%
C 4072
 
2.8%
G 574
 
0.4%
Dash Punctuation
ValueCountFrequency (%)
- 25088
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1191962
97.9%
Common 25088
 
2.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 233200
19.6%
t 147796
12.4%
r 144298
12.1%
a 143724
12.1%
T 114564
9.6%
s 114564
9.6%
i 114564
9.6%
e 114564
9.6%
y 25088
 
2.1%
P 25088
 
2.1%
Other values (6) 14512
 
1.2%
Common
ValueCountFrequency (%)
- 25088
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1217050
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n 233200
19.2%
t 147796
12.1%
r 144298
11.9%
a 143724
11.8%
T 114564
9.4%
s 114564
9.4%
i 114564
9.4%
e 114564
9.4%
y 25088
 
2.1%
- 25088
 
2.1%
Other values (7) 39600
 
3.3%

adr
Real number (ℝ)

ZEROS 

Distinct8866
Distinct (%)7.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean101.96909
Minimum-6.38
Maximum5400
Zeros1810
Zeros (%)1.5%
Negative1
Negative (%)< 0.1%
Memory size5.9 MiB
2024-05-29T15:33:47.023934image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum-6.38
5-th percentile39
Q169.5
median94.95
Q3126
95-th percentile193.5
Maximum5400
Range5406.38
Interquartile range (IQR)56.5

Descriptive statistics

Standard deviation50.434007
Coefficient of variation (CV)0.49460092
Kurtosis1022.8267
Mean101.96909
Median Absolute Deviation (MAD)27.95
Skewness10.612728
Sum12155735
Variance2543.589
MonotonicityNot monotonic
2024-05-29T15:33:47.190711image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
62 3754
 
3.1%
75 2715
 
2.3%
90 2472
 
2.1%
65 2418
 
2.0%
80 1889
 
1.6%
0 1810
 
1.5%
95 1661
 
1.4%
120 1607
 
1.3%
100 1573
 
1.3%
85 1538
 
1.3%
Other values (8856) 97773
82.0%
ValueCountFrequency (%)
-6.38 1
 
< 0.1%
0 1810
1.5%
0.26 1
 
< 0.1%
0.5 1
 
< 0.1%
1 14
 
< 0.1%
1.48 1
 
< 0.1%
1.56 2
 
< 0.1%
1.6 1
 
< 0.1%
1.8 1
 
< 0.1%
2 12
 
< 0.1%
ValueCountFrequency (%)
5400 1
< 0.1%
510 1
< 0.1%
508 1
< 0.1%
451.5 1
< 0.1%
450 1
< 0.1%
437 1
< 0.1%
426.25 1
< 0.1%
402 1
< 0.1%
397.38 1
< 0.1%
392 2
< 0.1%

required_car_parking_spaces
Categorical

IMBALANCE 

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size10.6 MiB
0
111801 
1
 
7376
2
 
28
3
 
3
8
 
2

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters119210
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 111801
93.8%
1 7376
 
6.2%
2 28
 
< 0.1%
3 3
 
< 0.1%
8 2
 
< 0.1%

Length

2024-05-29T15:33:47.308209image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-29T15:33:47.399534image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
0 111801
93.8%
1 7376
 
6.2%
2 28
 
< 0.1%
3 3
 
< 0.1%
8 2
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
0 111801
93.8%
1 7376
 
6.2%
2 28
 
< 0.1%
3 3
 
< 0.1%
8 2
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 119210
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 111801
93.8%
1 7376
 
6.2%
2 28
 
< 0.1%
3 3
 
< 0.1%
8 2
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common 119210
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 111801
93.8%
1 7376
 
6.2%
2 28
 
< 0.1%
3 3
 
< 0.1%
8 2
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 119210
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 111801
93.8%
1 7376
 
6.2%
2 28
 
< 0.1%
3 3
 
< 0.1%
8 2
 
< 0.1%

total_of_special_requests
Real number (ℝ)

ZEROS 

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.57150407
Minimum0
Maximum5
Zeros70201
Zeros (%)58.9%
Negative0
Negative (%)0.0%
Memory size5.9 MiB
2024-05-29T15:33:47.486923image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile2
Maximum5
Range5
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.7928759
Coefficient of variation (CV)1.3873495
Kurtosis1.4926142
Mean0.57150407
Median Absolute Deviation (MAD)0
Skewness1.3490487
Sum68129
Variance0.62865219
MonotonicityNot monotonic
2024-05-29T15:33:47.583308image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 70201
58.9%
1 33183
27.8%
2 12952
 
10.9%
3 2494
 
2.1%
4 340
 
0.3%
5 40
 
< 0.1%
ValueCountFrequency (%)
0 70201
58.9%
1 33183
27.8%
2 12952
 
10.9%
3 2494
 
2.1%
4 340
 
0.3%
5 40
 
< 0.1%
ValueCountFrequency (%)
5 40
 
< 0.1%
4 340
 
0.3%
3 2494
 
2.1%
2 12952
 
10.9%
1 33183
27.8%
0 70201
58.9%
Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size11.5 MiB
Check-Out
75011 
Canceled
42993 
No-Show
 
1206

Length

Max length9
Median length9
Mean length8.6191175
Min length7

Characters and Unicode

Total characters1027485
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCheck-Out
2nd rowCheck-Out
3rd rowCheck-Out
4th rowCheck-Out
5th rowCheck-Out

Common Values

ValueCountFrequency (%)
Check-Out 75011
62.9%
Canceled 42993
36.1%
No-Show 1206
 
1.0%

Length

2024-05-29T15:33:47.719631image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-29T15:33:47.818644image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
check-out 75011
62.9%
canceled 42993
36.1%
no-show 1206
 
1.0%

Most occurring characters

ValueCountFrequency (%)
e 160997
15.7%
C 118004
11.5%
c 118004
11.5%
h 76217
7.4%
- 76217
7.4%
u 75011
7.3%
t 75011
7.3%
O 75011
7.3%
k 75011
7.3%
a 42993
 
4.2%
Other values (7) 135009
13.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 755841
73.6%
Uppercase Letter 195427
 
19.0%
Dash Punctuation 76217
 
7.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 160997
21.3%
c 118004
15.6%
h 76217
10.1%
u 75011
9.9%
t 75011
9.9%
k 75011
9.9%
a 42993
 
5.7%
n 42993
 
5.7%
l 42993
 
5.7%
d 42993
 
5.7%
Other values (2) 3618
 
0.5%
Uppercase Letter
ValueCountFrequency (%)
C 118004
60.4%
O 75011
38.4%
N 1206
 
0.6%
S 1206
 
0.6%
Dash Punctuation
ValueCountFrequency (%)
- 76217
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 951268
92.6%
Common 76217
 
7.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 160997
16.9%
C 118004
12.4%
c 118004
12.4%
h 76217
8.0%
u 75011
7.9%
t 75011
7.9%
O 75011
7.9%
k 75011
7.9%
a 42993
 
4.5%
n 42993
 
4.5%
Other values (6) 92016
9.7%
Common
ValueCountFrequency (%)
- 76217
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1027485
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 160997
15.7%
C 118004
11.5%
c 118004
11.5%
h 76217
7.4%
- 76217
7.4%
u 75011
7.3%
t 75011
7.3%
O 75011
7.3%
k 75011
7.3%
a 42993
 
4.2%
Other values (7) 135009
13.1%
Distinct926
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size5.9 MiB
Minimum2014-10-17 00:00:00
Maximum2017-09-14 00:00:00
2024-05-29T15:33:47.933648image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:48.063095image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-05-29T15:33:36.018386image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:16.133697image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:18.511587image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:19.882327image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:21.212953image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:22.748942image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:24.067146image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:25.369312image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:26.646478image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:28.247933image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:29.651610image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:30.940387image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:32.337371image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:34.545948image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:36.125530image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:16.251956image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:18.611899image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:19.981744image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:21.317167image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:22.840928image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:24.174086image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:25.469106image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:26.747014image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:28.348027image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:29.756500image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:31.031212image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:32.438974image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:34.654004image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:36.225709image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:16.374028image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:18.705343image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:20.079557image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:21.421167image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:22.926555image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:24.276063image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:25.558799image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:26.839858image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:28.438105image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:29.854693image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:31.126817image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:32.539907image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:34.748538image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:36.328508image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:16.505297image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:18.800665image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:20.180765image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:21.518563image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:23.016165image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:24.371007image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:25.650685image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:27.184111image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:28.532348image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:29.948587image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:31.226531image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:32.635087image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:34.848108image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:36.430778image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:16.612785image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:18.897904image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:20.276945image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:21.612923image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:23.110599image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:24.468932image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:25.748201image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:27.294821image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:28.663115image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:30.039462image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:31.329952image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:32.751923image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:34.959725image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:36.529033image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:16.706479image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:18.987854image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:20.364800image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:21.704185image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:23.193547image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:24.556726image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:25.833818image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:27.385506image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:28.779007image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:30.128209image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:31.415797image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:32.857895image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:35.056920image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:36.623465image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:16.796757image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:19.091492image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:20.452799image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:21.792597image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:23.276480image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:24.646706image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:25.919019image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:27.477208image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:28.886207image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:30.213150image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:31.501443image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:32.976485image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:35.153788image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:36.721514image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:16.887885image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:19.182318image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:20.540826image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:22.066159image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:23.365716image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:24.740985image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:26.008162image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:27.570675image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:28.984369image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:30.303639image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:31.593678image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:33.105043image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:35.259101image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:36.826452image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:17.904619image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:19.283220image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:20.638542image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:22.165375image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:23.468102image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:24.838579image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:26.106345image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:27.666138image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:29.084634image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:30.396604image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:31.692384image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:33.392488image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:35.369418image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:36.928737image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:18.009730image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:19.384040image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:20.741807image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:22.266364image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:23.566193image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:24.930903image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:26.201707image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:27.766517image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:29.179276image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:30.491041image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:31.813212image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:33.562604image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:35.523479image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:37.021547image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:18.115593image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:19.473916image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:20.841700image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:22.364671image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:23.651924image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:25.013588image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:26.293872image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:27.862561image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:29.272001image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:30.576233image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:31.947953image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:34.098504image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:35.630956image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:37.121923image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:18.211544image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:19.578112image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:20.931303image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:22.455888image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:23.742234image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:25.100543image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:26.378112image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:27.956110image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:29.375137image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:30.664291image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:32.050836image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:34.213006image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:35.720594image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:37.233584image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:18.321224image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:19.684541image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:21.032478image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:22.555896image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:23.858038image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:25.198789image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:26.475369image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:28.065529image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:29.472325image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:30.769272image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:32.154308image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:34.330438image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:35.825249image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:37.335106image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:18.414575image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:19.785999image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:21.123712image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:22.646913image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:23.960118image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:25.285420image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:26.557943image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:28.156027image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:29.563481image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:30.854748image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:32.247236image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:34.442155image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-05-29T15:33:35.916797image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Missing values

2024-05-29T15:33:37.550724image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-29T15:33:38.176859image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

hotelis_canceledlead_timearrival_date_yeararrival_date_montharrival_date_week_numberarrival_date_day_of_monthstays_in_weekend_nightsstays_in_week_nightsadultschildrenbabiesmealcountrymarket_segmentdistribution_channelis_repeated_guestprevious_cancellationsprevious_bookings_not_canceledreserved_room_typeassigned_room_typebooking_changesdeposit_typeagentcompanydays_in_waiting_listcustomer_typeadrrequired_car_parking_spacestotal_of_special_requestsreservation_statusreservation_status_date
0Resort Hotel03422015July2710020.00BBPRTDirectDirect000CC3No Deposit0.00.00Transient0.000Check-Out2015-07-01
1Resort Hotel07372015July2710020.00BBPRTDirectDirect000CC4No Deposit0.00.00Transient0.000Check-Out2015-07-01
2Resort Hotel072015July2710110.00BBGBRDirectDirect000AC0No Deposit0.00.00Transient75.000Check-Out2015-07-02
3Resort Hotel0132015July2710110.00BBGBRCorporateCorporate000AA0No Deposit304.00.00Transient75.000Check-Out2015-07-02
4Resort Hotel0142015July2710220.00BBGBROnline TATA/TO000AA0No Deposit240.00.00Transient98.001Check-Out2015-07-03
5Resort Hotel0142015July2710220.00BBGBROnline TATA/TO000AA0No Deposit240.00.00Transient98.001Check-Out2015-07-03
6Resort Hotel002015July2710220.00BBPRTDirectDirect000CC0No Deposit0.00.00Transient107.000Check-Out2015-07-03
7Resort Hotel092015July2710220.00FBPRTDirectDirect000CC0No Deposit303.00.00Transient103.001Check-Out2015-07-03
8Resort Hotel1852015July2710320.00BBPRTOnline TATA/TO000AA0No Deposit240.00.00Transient82.001Canceled2015-05-06
9Resort Hotel1752015July2710320.00HBPRTOffline TA/TOTA/TO000DD0No Deposit15.00.00Transient105.500Canceled2015-04-22
hotelis_canceledlead_timearrival_date_yeararrival_date_montharrival_date_week_numberarrival_date_day_of_monthstays_in_weekend_nightsstays_in_week_nightsadultschildrenbabiesmealcountrymarket_segmentdistribution_channelis_repeated_guestprevious_cancellationsprevious_bookings_not_canceledreserved_room_typeassigned_room_typebooking_changesdeposit_typeagentcompanydays_in_waiting_listcustomer_typeadrrequired_car_parking_spacestotal_of_special_requestsreservation_statusreservation_status_date
119380City Hotel0442017August35311320.00SCDEUOnline TATA/TO000AA0No Deposit9.00.00Transient140.7501Check-Out2017-09-04
119381City Hotel01882017August35312320.00BBDEUDirectDirect000AA0No Deposit14.00.00Transient99.0000Check-Out2017-09-05
119382City Hotel01352017August35302430.00BBJPNOnline TATA/TO000GG0No Deposit7.00.00Transient209.0000Check-Out2017-09-05
119383City Hotel01642017August35312420.00BBDEUOffline TA/TOTA/TO000AA0No Deposit42.00.00Transient87.6000Check-Out2017-09-06
119384City Hotel0212017August35302520.00BBBELOffline TA/TOTA/TO000AA0No Deposit394.00.00Transient96.1402Check-Out2017-09-06
119385City Hotel0232017August35302520.00BBBELOffline TA/TOTA/TO000AA0No Deposit394.00.00Transient96.1400Check-Out2017-09-06
119386City Hotel01022017August35312530.00BBFRAOnline TATA/TO000EE0No Deposit9.00.00Transient225.4302Check-Out2017-09-07
119387City Hotel0342017August35312520.00BBDEUOnline TATA/TO000DD0No Deposit9.00.00Transient157.7104Check-Out2017-09-07
119388City Hotel01092017August35312520.00BBGBROnline TATA/TO000AA0No Deposit89.00.00Transient104.4000Check-Out2017-09-07
119389City Hotel02052017August35292720.00HBDEUOnline TATA/TO000AA0No Deposit9.00.00Transient151.2002Check-Out2017-09-07

Duplicate rows

Most frequently occurring

hotelis_canceledlead_timearrival_date_yeararrival_date_montharrival_date_week_numberarrival_date_day_of_monthstays_in_weekend_nightsstays_in_week_nightsadultschildrenbabiesmealcountrymarket_segmentdistribution_channelis_repeated_guestprevious_cancellationsprevious_bookings_not_canceledreserved_room_typeassigned_room_typebooking_changesdeposit_typeagentcompanydays_in_waiting_listcustomer_typeadrrequired_car_parking_spacestotal_of_special_requestsreservation_statusreservation_status_date# duplicates
5398City Hotel12772016November4671220.00BBPRTGroupsTA/TO000AA0Non Refund0.00.00Transient100.000Canceled2016-04-04180
4176City Hotel1682016February8170220.00BBPRTGroupsTA/TO010AA0Non Refund37.00.00Transient75.000Canceled2016-01-06150
5070City Hotel11882016June25150210.00BBPRTOffline TA/TOTA/TO000AA0Non Refund119.00.039Transient130.000Canceled2016-01-18109
4874City Hotel11582016May22240210.00BBPRTGroupsTA/TO000AA0Non Refund37.00.031Transient130.000Canceled2016-01-18101
3845City Hotel1342015December5080210.00BBPRTOffline TA/TOTA/TO010AA0Non Refund19.00.00Transient90.000Canceled2015-11-17100
3787City Hotel1282017March920320.00BBPRTGroupsTA/TO000AA0Non Refund0.00.00Transient95.000Canceled2017-02-0299
3901City Hotel1382017January2140110.00BBPRTCorporateCorporate000AA0Non Refund0.067.00Transient75.000Canceled2016-12-0799
4867City Hotel11562017April17260320.00BBPRTGroupsTA/TO000AA0Non Refund37.00.00Transient100.000Canceled2016-11-2199
4200City Hotel1712016June25140310.00BBPRTOffline TA/TOTA/TO000AA0Non Refund236.00.00Transient120.000Canceled2016-04-2789
4934City Hotel11662016November4510310.00BBPRTOffline TA/TOTA/TO000AA0Non Refund236.00.00Transient110.000Canceled2016-07-1385